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Abstract. Labeled unranked trees are used as a model of XML documents, and logical 
languages for them have been studied actively over the past several years. Such logics 
have different purposes: some are better suited for extracting data, some for expressing 
navigational properties, and some make it easy to relate complex properties of trees to 
the existence of tree automata for those properties. Furthermore, logics differ significantly 
in their model-checking properties, their automata models, and their behavior on ordered 
and unordered trees. In this paper we present a survey of logics for unranked trees. 



1. Introduction 

Trees arise everywhere in computer science, and there are numerous formahsms in the ht- 
erature for describing and manipulating trees. Some of these formahsms are declarative 
and based on logical specifications: for example, first-order logic, monadic second-order 
logic, and various temporal and fixed-point logics over trees. Others are procedural for- 
malisms such as various flavors of tree automata, or tree transducers, or tree grammars. All 
these formalisms have found numerous applications in verification, program analysis, logic 
programming, constraint programming, linguistics, and databases. 

Until recently, most logical formalisms for trees dealt with ranked trees |C(j-|-02l ITho97j : 
in such trees, all nodes have the same fixed number of children (or, a bit more generally, 
the number of children of a node is determined by the label of that node). Over the past 
several years, however, the focus has shifted towards unranked trees, in which there are no 
restrictions on the number of children a node can have. For example, the left tree in Figure^ 
is a binary tree in which every non-leaf node has two children. In the second tree in Figure^ 
however, different nodes have a different number of children. Although unranked trees have 
been considered in the 60s an d 70s | PQ68,, Tak75| IThaBTj . and are related to feature trees 
over an infinite set of features |Smo92j which are a particular kind of feature structures that 
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Figure 1: A ranked (binary) and an unranked tree 



have been investigated by computational linguists |Bla94l ICar921 IRK86j , their systematic 
study was initiated by the development of XML (extensible Markup Language). XML 
is a data format which has become the lingua franca for information exchange on the 
World Wide Web. In particular, XML data is typically modeled as labeled unranked trees 
|JNevn2llVmnT] . 

This connection has led to a renewed interest in logical and procedural formalisms for 
unranked trees. Since XML trees are used to exchange data, the usual database query 
language paradigms apply: one uses logical formalisms for expressing declarative queries, 
and procedural formalisms for evaluating those declarative queries. Logics over unranked 
trees defining a variety of query languages for them appeared in large numbers over the past 
7-8 years, and they come in many flavors and shapes. What is common to them, however, 
is a close connection to automata models, and quite often to temporal and modal logics, 
especially when one describes properties of paths through a document. 

Let us now review some of the parameters according to which logics for unranked trees can 
be classified. 

The yardstick logic: Most formalisms are "inspired" by one of the two logics often 
used in the context of trees: first-order logic (FO), and monadic second-order logic 
(MSO) that extends FO by quantification over sets of nodes. Query languages 
and schema formalisms for XML tend to use MSO as the yardstick: for example, 
XML Document Type Definition (DTDs, or, more precisely, XSD - XML Schema 
Definition) are essentially equivalent to MSO sentences, and various languages for 
extraction of data from XML documents, although being syntactically very different, 
have the power of MSO unary queries. On the other hand, navigational aspects of 
XML, in particular, logics capturing various fragments of XPath, are usually closely 
related to FO and its fragments. 

Arity of queries: Most commonly one considers Boolean or unary queries. Boolean 
queries are logical sentences and thus evaluate to true or false. For example, checking 
if an XML document conforms to a schema specification is represented by a Boolean 
query. Unary queries correspond to formulae in one free variable, and thus produce a 
set of nodes. For example, extracting sets of nodes, or evaluating XPath expressions 
relative to the root naturally give rise to unary queries. 

Complexity of model-checking/query-evaluation: The model-checking problem 
asks whether a tree T satisfies a logical sentence (p, written T \= (p. If (/9 is an MSO 
sentence ip, it can be evaluated in linear time in the size of T, by converting ip to a 
tree automaton. But there is a price to pay: in terms of the size of p, the complexity 
becomes non-elementary. This type of trade-off is one of the central issues in dealing 
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with logics over trees. Similar issues arise with evaluating formulae (^(x) in trees, 
that is, finding tuples s of nodes such that T \= ip{s). 
Ordered vs. unordered trees: In the standard definition of unranked trees in the 
XML context, children of the same node are ordered by a sibling ordering. If such 
an order is present, we speak of ordered unranked trees. In many cases, however, 
this ordering is irrelevant, and some unranked tree models, such as feature trees, 
do not impose any ordering on siblings. There is considerable difference between 
the expressiveness of logics and automata models depending on the availability of 
sibling ordering. The presence of ordering also affects the yardstick logic, since 
without order often counting is needed to match the power of automata models 

The paper is organized as follows. After we give basic definitions in Section |21 we move to 
logics for ordered trees. In Section |31 we deal with MSO-related logics, including syntactic 
restrictions of MSO, a datalog-based logic, and the /i-calculus. In Section we turn to 
FO-related logics, present analogs of LTL and CTL"^ that have been studied for expressing 
navigational properties, and also look at conjunctive queries over trees. In Section[Slwe turn 
to trees that lack the sibling ordering, and show that in many logics some form of counting 
needs to be added to compensate for the missing ordering. We also review ambient and 
feature logics over edge-labeled trees. In Section|Slwe look at the model-theoretic approach. 
We consider an infinite first-order structure whose universe is the set of all unranked trees 
and obtain some well-known classes of trees by studying first-order definability (in the classic 
model-theoretic sense) over that structure. 

2. Trees, logics, and automata 

2.1. Tree domains, trees, and operations on trees. Nodes in unranked trees are ele- 
ments of N* - that is, finite strings whose letters are natural numbers. A string s = uqUi . . . 
defines a path from the root to a give node: one goes to the ngth child of the root, then to 
the nith child of that element, etc. We shall write si • S2 for the concatenation of strings si 
and S2, and e for the empty string. 

We now define some basic binary relations on N*. The child relation is 

s -<ch s' ^ s' = s ■ i for some i S N. 

The next-sibling relation is given by: 

s ^ns s' <^ s = So ■ i and s' = sq ■ {i + 1) for some sq G N* and i € N. 

That is, s and s' are both children of the same sq € N*, and s' is next after s in the natural 
ordering of siblings. We also use the first child relation: s -<{c s ■ 0. These are shown in 
Figure [21 

We shall use * to denote the reflexive-transitive closure of a relation. Thus, -<*j^ is the 
descendant relation (including self): s -<*^ s' iff s is a prefix of s' or s = s'. The transitive 
closure of the next-sibling relation, is a linear ordering on siblings: s ■ i s • j iff i < j. 
We shall be referring to younger / older siblings with respect to this ordering (the one of the 
form s ■ is the oldest). 
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next-sibling -< 



Figure 2: Basic relations for unranked trees 

A set Z? C N* is called prefix-closed if whenever s G D and s' is a prefix of s (that is, 

s' s), then s' £ D. 

Definition 2.1 (Tree domain). A tree domain D is a finite prefix-closed subset of N* such 
that s ■ i G D implies s ■ j € D for all j < i. 

Let S be a finite alphabet. We define trees as structures that consist of a universe and a 

number of predicates on the universe. 

Definition 2.2 (S-trees). An ordered unranked tree T is a structure 

r=(A<h,<s,(^a)aes), 

where D is a tree domain, -<*^ and -<*s are the descendant relation and the sibling ordering, 
and the Pq's are interpreted as disjoint sets whose union is the entire domain D. 

An unordered unranked tree is defined as a structure {D, -<*^, {Pa)ae's)j where D, -<*^, and 
Pa's are as above. 

Thus, a tree consists of a tree domain together with a labeling on its nodes, which is captured 
by the Pa predicates: if s G Pa, then the label of s is a. In this case we write Aj'(s) = a. 

Notice that when dealing with unranked trees we assume that each node has one label. 
Later we shall see a connection with temporal logics, where such a restriction on labeling 
is normally not imposed. However, one could always assume unique labeling in that case 
too, simply by collecting the set of all labels of a node (in this case the labeling alphabet 
becomes 2^). 

2.2. First-order and monadic second-order logic. We shall only consider relational 
vocabularies, that is, finite lists (i?i, . . . , Rm) of relation symbols, each Ri with an associated 
arity Ui. Over trees, relation symbols will be either binary (e.g., ^ch, ^ns, ^*h) or unary 
(the Pa's for a G E). 

Formulae of first-order logic (FO) arc built from atomic formulae x = x' , and R{x), where 
X, x' are variables, and a; is a tuple of variables whose length equals the arity of R, using the 
Boolean connectives V, A, -> and quantifiers 3 and V. If a formula tp has free variables x, we 
shall write ip{x). Formulae are evaluated on a structure, which consists of a universe and 
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interpretations for relations. Quantifiers 3 and V range over the universe of the structure. 
For example, an FO formula 

ip{x) = Pa{x) A3y3z{x ^l^^y Ay ^l^z APb{y) APc{z)) 

is true for nodes s in a tree T that are labeled a, have a descendant labeled b, which in turn 
has a younger sibling labeled c. 

Formulae of monadic second-order logic (MSO) in addition allow quantification over sets. 
We shall normally denote sets of nodes by upper case letters. Thus, MSO formulae have the 
usual first-order quantifiers 3x(p and Wxcp as well as second-order quantifiers 3Xlp and WXcp, 
and new atomic formulae X{x), where X is a second-order variable and x is a first-order 
variable. An MSO formula may have both free first-order and second-order variables. If it 
only has free first-order variables, then it defines a relation on the universe of the structure. 
As an example, an MSO formula ^odd{x,y) given by the conjunction of x y and 

( Vz((x<,z<,y) ^ (X(z) w^y(z))) \ 
3X3Y A {X{x)AY{y)) 

^ A VzVt; (x z ^ch V y - {(Xiz) ^ Yiv)) A (Viz) ^ Xiv)))) ^ 

says that y is a descendant of x and the path between them is of odd length. It says that 
there exist two sets, X and Y, that partition the path from x to y, such that x £ X , y €z Y , 
and the successor of each element in X is in y, and the successor of each element in Y is in 
X. In the formula above, x -<*^^ z -<*^^ y is of course an abbreviation for {x z)A{z -<*^^ y) 
and likewise for x 2; -<ch y- 

Note that the relations and ^ns are definable, even in FO, from and ^*g: for 
example, 

-(x = y)A{x y) A Vz((x z)A{z^*^^y)^{x = zyy = z)) 

defines the child relation from In MSO one can define from -<ch by stating the 

existence of a path between two nodes (and likewise -<*g from -<ns)- However, it is well- 
known that in FO one cannot define -<*j^ from (cf. |Lib04j l and this is why we chose 
and rather than and ^ns, as our basic relations. However, in all the results 
about MSO, we may assume that the basic relations are ^ch and ^ns- 

In the introduction, we mentioned that we are mostly interested (in this survey) in Boolean 
and unary queries. A Boolean query over trees is just a set of trees closed under isomorphism 
(that is, a query cannot distinguish between two isomorphic trees). A unary query Q is a 
mapping that associates with each tree T a subset Q{T) of its domain. Again, a query is 
required to be closed under isomorphism. 

Definition 2.3 (Definability in logic). Given a logic C, we say that a Boolean query (that 
is, a set T of trees) is definable in C if there is a sentence of C such that T G T iff T |= 
We say that a unary query Q is definable in C if there is a formula ij{x) of C such that 
s S QiT) iff T ^ V'(s)) for every tree T and a node s in T. 
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s At(s) = a 




qi q2 qn-1 qn 



Pa{s) ^ q if qi- ■ ■ q,, e S{q,a) 
Figure 3: Run of an unranked tree automaton 

2.3. Unranked tree automata. A nondeterministic unranked tree automaton, NUTA 
|Tha67l fBMWOlj . over S-labeled trees is a triple A = {Q,F,6) where Q is a finite set of 
states, F C Q is the set of final states, and 5 is a mapping Q x S — > 2^* such that S{q,a) 
is a regular language over Q (normally represented by a regular expression over Q). A run 
of „4 on a tree T with domain Z? is a function PA ■ D ^ Q such that: 

if s is a node with n children, and it is labeled a, then the string 
Pa{s • 0) • • • pa{s ■ (n - 1)) is in 5(p^(s), a). 

This is illustrated in Figure El In particular, if s is a leaf labeled a, then p^(s) = q implies 
that £ € 5{q,a). A run is accepting if G -F, that is, the root is in an accepting state. 

A tree T is accepted by A if there exists an accepting run. We let L[A) denote the set of 
all trees accepted by A. Such sets of trees will be called regular. 

There could be different representations of NUTAs, depending on how regular expressions 
over Q are represented. These issues are discussed in |iNev02. .MNOSj . 

2.4. Binary trees and translations. A binary tree domain is a prefix-closed subset D of 
{0, 1}* such that if s • i G D, then s ■ [1 — i) G D (that is, a node is either a leaf, or both its 
children are va. D). It is common to define (binary) tree automata with both initial and final 
states, using the initial states to avoid conditions e € 5{q, a) imposed in the runs of NUTAs. 
That is, a (binary) nondeterministic tree automaton, NTA, is a quadruple At = (Q, Qo; ^) 
where Q and F are as before, qQ is the initial state, and 5 is a function Q x Q x S — > 2*^. In 
this case a run pA,, on a binary tree T with domain D is a function from D to Q such that if 
s is a leaf labeled a, then PA^is) G 5{qo, qo, a), and if s • 0, s • 1 belong to D, and s is labeled 
a, then PAti^) ^ ^{PAbi'^ ' 0))Pa(^ ' 1))^). As before, a run is accepting if PAti^) ^ ^) ^-^id 
L{Ab) is the set of all binary trees for which there exists an accepting run of Ab- Sets of 
trees of this form are regular sets (of binary trees). 

There is a well-known regularity-preserving translation between unranked and ranked trees. 
It was first used in |Rab69j to show decidability of SwS (but here we shall apply it only to 
finite tree domains) . The idea of the translation is that the first successor in the binary tree 
corresponds to the first child, and the second successor to the next sibling. More precisely, 
we define a mapping 7?. : N* ^ {0, 1}* such that 7^(e) = e, and if TZ{s) = s', where s = SQ-i, 
then Tl{s • 0) = s' • and TZ{so ■ {i + 1)) = s' ■ 1. Or, equivalently, 7^(e) = e, and if 7^(s) = s' , 
then 7^(s • /) = s' • • V . 
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Figure 4: A unranked tree T and its translation TZ{T) 



If D is an unranked tree domain, we let TZ{D) be | s G D} together with 7^(s) • 1 if s 

is a non-leaf last child, and TZ{s) • if s a leaf, other than the last sibling (these additions 
ensure that TZ{D) is a binary tree domain). We define TZ{T) to be a tree with domain TZ{D), 
where TZ{s) has the same label as s, and the added nodes are labeled by a symbol _L S. 
An example is shown in Figure EI 

The following is a folklore result. 

Lemma 2.4. For every NUTA A, there is an NTA Ab such that L{Ah) = {Tl{T) \ T G 
L{A)}, and conversely, for every NTA Ah there is an NUTA A such that the above holds. 

Moreover, Ah can be constructed from A very fast, in Dlogspace |GK+05| . 

Other regularity-preserving translations from unranked trees to binary trees exist. For ex- 
ample, |CNT04j views unranked trees as built from labeled nodes by means of a binary 
operation T@T' that attaches T' at the new youngest child of the root of T. This immedi- 
ately yields a binary tree representation and an automaton construction, and of course an 
analog of Lemma 12.41 holds. 



3. Ordered trees: MSO and its relatives 



In the next two sections we only deal with ordered unranked trees. 

As we mentioned already, MSO is often used as a yardstick logic for trees, because of its 
close connection to regular languages. The following result belonged to folklore, and was 
explicitly stated in |Nev99j . 

Theorem 3.1. A set of unranked trees is regular iff it is definable in MSO. 

When restricted to strings and binary trees, this corresponds to well-known results by Biichi 
|Biic6fl) saying that MSO equals regular languages over strings, and by Thatcher, Wright 
|TWfi8j . and Doner |l)ou7nj . saying that MSO equals regular (binary) tree languages. 

There is also a close connection between automata, MSO, and the common formalism 
for describing schemas for XML documents called DTDs, which are essentially extended 
context-free grammars. A DTD d over an alphabet S is a collection of rules a — > e^, where 
a G S and Ca is a regular expression over E. We shall assume there is at most one such rule 
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for each a € S. A S-labeled tree T satisfies d, if for each node s of T with n children, and 
Xt{s) = a, the string Xt{s • 0) • • • At(s • {n- — 1)) is in the language denoted by ea- We write 
SAT((i) for the set of trees that satisfy d. 

Each DTD is easily definable by an unranked tree automaton: in fact its states just cor- 
respond to labels of nodes. This, however, is too restrictive to capture full definability in 
MSO. In fact, DTDs (that is, sets of the form SAT{d)) are closed under neither unions 
nor complement, which makes DTDs unsuitable for capturing a logic with disjunction and 
negation. 

However, a slight extension of DTDs does capture MSO. An extended DTD over E is a 
triple iTi',d',g) where S' D S, with g being a mapping g : S' i-^ S, and d' is a DTD over 
S'. We say that a S-labeled tree T satisfies {T,',d',g) if there is a S'-labeled tree T' that 
satisfies d' such that T = g{T') (more formally, T is obtained by replacing each label a in 
T' by g{a)). We write Sat(Y!,' , d' , g) for the set of trees that satisfy (E^d^g). 

The following was established in |Tha67j and then restated using the DTD terminology in 

[pvnnllVmnT] . 

Proposition 3.2. A set of unranked trees is MSO definable iff it is of the form Sat(S', d' , g) 
for some extended DTD (Y,',d',g). 

Theorem 13.11 talks about MSO sentences, but it can be extended to unary MSO queries 
using the concept of query automata |NS02j . A (nondeterministic) query automaton over 
unranked S-labeled trees is a quadruple QA = {Q, F, 6, S) where A = {Q, F, 6) is an UNTA, 
and S* is a subset of Q x S. Such a query automaton defines two unary queries Qg^ and 
2qa on unranked trees: 

Existential semantics query: : s G Qqa(^) iff {pa{s)i ^t{s)) G S for some accepting 
run p_4. 

Universal semantics query: : s G Qqa(^) iff {pa{s), Xt{s)) € S for every accepting 
run p^. 

Theorem 3.3. (see |NS02l lNev99[ lFGK03j ) For a unary query Q on unranked trees, the 
following are equivalent: 

(1) Q is definable in MSO; 

(2) Q is of the form for some query automaton QA; 

(3) Q is of the form Qq^ for some query automaton QA. 

Query automata, just as usual tree automata, have a deterministic counterpart; however, 
in the deterministic version, two passes over the tree are required. See (NS02j for details. 

Theorems 13.11 and 13.31 are constructive. In particular, every MSO sentence ip can be ef- 
fectively transformed into an automaton A,p that accepts a tree T iff T ^ 93. Since tree 
automata can be determinized, this gives us a 0(||T||) algorithm to check whether T \= (p, 
if if is fixed. ^ However, it is well-known that the size of Aip (even for string automata) 
cannot be bounded by an elementary function in ||(^|| |SM02j . An even stronger result of 

"'^We use the notation ||r||, ||<^|| to denote the sizes of natural encodings of trees and formulae. 
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|FGn2j says that there could be no algorithm for checking whether T \= ip that runs in time 
0(/(||</5||) • ||r||), where / is an elementary function, unless Ptime=NP. 

Nonetheless, these results do not rule out the existence of a logic C that has the same power 
as MSO and yet permits faster model-checking algorithms. Even looking at a simpler case 
of FO on strings, where results of |FG02j also rule out 0(/(|[(^|[)-|sj) algorithms for checking 
if a string s satisfies 93, with / being an elementary function, the logic LTL (linear-time 
temporal logic) has the same expressiveness as FO |Kam68j and admits a model-checking 
algorithm with running time 

20(IMI) . \s\. 

3.1. Logic ETL. The first logic for unranked trees that has the power of MSO and model- 
checking complexity matching that of LTL appeared in |NSnflj and was called ETL [efficient 
tree logic). It was obtained by putting syntactic restrictions on MSO formulae, and at the 
same time adding new constructors for formulae, which are not present in MSO, but are 
MSO-definable. 

The atomic formulae of ETL are the same as for MSO, except that we are allowed to use 
both -<ch ^-nd ^*[^ and are not allowed to use the next-sibling relation The formulae of 
ETL are then closed under Boolean combinations, guarded quantification, and path formulae. 
The rules for guarded quantification are as follows: 

• if 99(3;, y, X) is an ETL formula, then 3y {x -<ch y A 99) and 3y (x U ^ v) ^I's 
ETL formulae; 

• if (p{x,X) is an ETL formula, then 3X [x X /\ ip) is an ETL formula. Here 
X -<*^T^ X means that X only contains descendants of x. In this case (p cannot contain 
vertical path formulae (defined below). 

Path formulae are defined below, and illustrated in Figure El 

• if e is a regular expression over ETL formulae of the form ip{u,v), then e^{x,y) is 
a (vertical path) ETL formula. The semantics is as follows: T \= e^{s,s') if there 
is a child-relation path s = sq, si, . . . , Sn = s' in T and a sequence of ETL formulae 
ipi{u,v), i < n — 1, such that T \= ipi{si, Sj+i) for each i < n — 1, and the sequence 
tpo . . . tpn-i matches e. 

• if e is a regular expression over ETL formulae of the form 'iIj{u,X), then e~^{x,X) 
is a (horizontal path) ETL formula. Then T \= e^{s,X) if children s ■ i,i < k of s 
can be labeled with ETL formulae ipi{u, X) such that T \= tlji{s ■ i, X) for all i, and 
the sequence ■00 • • • V'fc matches e. 

We also define a slight syntactic modification ETL° of ETL, in which the closure under 
Boolean connectives is replaced by a rule that formulae are closed under taking Boolean 
combinations which are in DNF: that is, if (pij^s are ETL° formulae, then \/- /\- Lp\ - is an 
ETL° formula, where each Lp\ - is either or -^pij. Clearly the expressiveness of ETL° is 
exactly the same as the expressiveness of ETL. 

Theorem 3.4. (see |NSOnj ) With respect to Boolean and unary queries, ETL and MSO 
are equally expressive. Furthermore, each ETL° formula ip can he evaluated on a tree T in 
time 2'^(ll^ll) • ||r||. 
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e^{x,y): ipo ■ ■ ■ fn-i & e e^{x) : Vi ■ ' ■ "^n G e 

Figure 5: The semantics of path formulae of ETL 

ETL formulae can thus be evaluated in linear time in the size of the tree, and double 
exponential time in \\(p\\, by converting Boolean combinations into DNF. It is not known if 
ETL itself admits a 2'^^^^'^^^^ ■ \\T\\ model-checking algorithm. 

3.2. Monadic datalog. Another approach to obtaining the full power of MSO while keep- 
ing the complexity low is based on the database query language datalog (cf . jAHV95j ) ; it was 
proposed in GKdH IGK02j . A datalog program can be viewed as a prolog program without 
function symbols. Datalog is often used to extend expressiveness of database queries beyond 
FO. 

A datalog program consists of a sequence of rules 

H :- Pi, . . . ,Pk, 

where H and all Pj's are atoms: that is, atomic formulae of the form E{x). The predicate 
H is called the head of the rule, and Pi, . . . ,Pk are called its body. Every variable that 
appears in the head is required to appear in the body. Given a datalog program V, predicates 
which appear as a head of some rule are called intensional, and other predicates are called 
extensional. If all intensional predicates are monadic (that is, of the form H{x)), then V is 
a monadic datalog program. 

Given a datalog program V with extensional predicates Pi, ... , Pm and intensional predi- 
cates Hi, ... , Hi, and a structure V = {D, Pf' , . . . , P^) that interprets each p-ary predicate 
Pi as P^ C DP, we define V{T>) as the least fixed point of the immediate consequence 
operator. This operator takes a structure Ti' = {D,H[, . . . ,H'^) and produces another 
structure 7i" = {D, H", . . . , H") such that a tuple a is in H" if it is in H'- or there is a rule 
Hi{x):-~Ri{x, y), . . . ,Rs{x,y) and a tuple h such that for each extensional predicate Ri, the 
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fact Ri(a,b) is true in T>, and for each intensional predicate Ri, the fact Ri{a,b) is true in 

n'. 

A monadic datalog query is a pair (V, H) where "P is a monadic datalog program, and H 
is an intensional predicate. The value of H in V{'D) is the output of this program on T>. 

We consider three unary predicates on unranked tree domains: Leaf, LastChild, and Root. 
Given a tree domain D, they are interpreted as 

Leaf = {s e D \ -<3s' e D : s -<ch s'}, 
LastChild = {s ■ i e D \ s ■ {i + 1) ^ D}, 
Root = {e}. 

Theorem 3.5. (see jGK04n A unary query over unranked trees is definable in MSO iff it 
is definable in monadic datalog over extensional predicates ^fc, ^ns; Leaf, LastChild, Root, 
and Pa, a £ S . 

Furthermore, each monadic datalog query {V, H) can be evaluated on a tree T in time 

om\-\\T\\). 

There are two proofs of this result in |(TKn4j : one codes query automata in monadic datalog, 
and the other one uses the standard reduction to ranked trees and the composition method 
(cf. jHTHZl) for MSO games. 

3.3. /^-calculus. Yet another way of getting a logic equivalent to MSO is suggested by 
a close connection between MSO and the modal ^-calculus L^ on ranked trees, which 
can easily be extended to the unranked case by using the connection between ranked and 
unranked trees. It was shown in |EJ911 INiw88j that every property of infinite binary trees 
definable in MSO is also definable in L^. To deal with unranked trees, we shall define 
L^ over S-labeled structures that have several binary relations Ei,...,Em, cf. |AN01j . 
Formulae of L^ are given by 

(/? := a (a € S) I X I (/9 V (/? I -^f \ 0{Ei)(p \ fj,X (p{X), 

where in fiX ^p{X), the variable X must occur positively in (p. Given a structure T with 
domain D, s £ D, and a valuation v for free variables (such that each v{X) is a subset of 
D), we define the semantics by 

• (T, v,s) \= a iE s is labeled a. 

• (T, v,s) \= LfV (p' iff (T, v,s) \= If or (T, v, s) \= ip'. 

• (T, V, s) \= iff (T, v,s) \= (p is false. 

• iT,v,s) iff s G v{X). 

• {T,v,s) \= 0{Er)p iff {T,v,s') \= p for some s' with {s,s') G Er- 

• (T, V, s) \= fiX ipiX) iff s is in the least fixed point of the operator defined by p>; in 
other words, if 

sGf]{P\{s'\iT,v[P/X],s')^p}CP}, 
where v[P/X] extends the valuation v by v{X) = P. 
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We shall list explicitly binary relations Ei, writing L^[£'i, . . . , Em] to refer formulae that 
only use those relations. An formula (p without free variables naturally defines a unary 
query on trees ({s | {T,s) \= if}) and a Boolean query on trees (by checking if (T, e) \= ip). 

Using the translation into ranked trees (or direct coding of automata), it is easy to show 
the following (see IBLOSj ): 

Proposition 3.6. The class of Boolean MSO queries on unranked trees is precisely the 
class of Boolean queries defined by L^f^fc, ^ns]- 

If we consider unranked trees as structures with relations and ^ns, then they are acyclic, 
and hence the complexity of model checking is 0(||(/3|p • ||r||) |Mat02j . Furthermore, results 
of [Matn2j tell us that one can strengthen Proposition MSO equals alternation-free 
over -<fc, ^ns- For alternation- free formulae over unranked trees the complexity 
of model-checking further reduces to 0(||99|| • ||T||), matching the complexity of monadic 
datalog. 

It is also possible to characterize unary MSO queries over unranked trees in terms of the full 
/x-calculus L^^^ (cf. |Var98j ) which adds backward modalities <>{E~)ip with the semantics 

• (T, s) \= 0{E~)(p iff (T, s') \= if for some s' such that (s', s) G Ei. 

Proposition 3.7. (see BLOSj) The class of unary MSO queries on unranked trees is pre- 
cisely the class of queries defined by L^"[^chi -<ns]- 

There are other fixed-point constructions that have been shown to capture the power of 
automata and MSO over unranked trees; see, e.g. |NS98j . 

4. Ordered trees: FO and its relatives 

We continue dealing with ordered trees, but now we move to logics closely related to first- 
order, as opposed to monadic second-order. 

While a lot is known about FO on both finite and infinite strings, it has not been nearly 
as extensively studied for trees until very recently. Recall that over strings - which we 
can view as trees with only unary branching - FO defines precisely the star-free languages 
(cf. |Tho97j ) ■ and over both finite and infinite strings FO has exactly the power of LTL 
|Kam68j . It can further be characterized by aperiodicity of the syntactic monoid (cf. |Str94j ) . 

In contrast, the natural analog of star- free expressions over binary trees captures not FO 
but MSO |P'r9.Sj . Algebrai c characterizat ions of FO-definable classes of binary trees have 
been obtained very recently ;BSn5(IBWn41IEWn.Sj . with |BSn5j showing that FO-definability 
(without the descendant relation) is decidable for regular tree languages. One well-known 
equivalent logical description of FO on binary trees is Hafer-Thomas's theorem HT87] 
stating that over finite binary trees, FO = CTL* (CTL* is a branching time temporal logic 
widely used in verification, cf. |CGP99j . and it will be defined shortly). Actually, the result 
of |HT87j shows that CTL* is equivalent to MSO with second-order quantification over 
paths only, but over finite trees this fragment of MSO is equivalent to FO. 
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The interest in logics over unranked trees whose power is equal to or subsumed by that of 
FO stems from the fact that navigational features of XPath can be described in FO. XPath 
|CD99j is a W3C standard for describing paths in XML documents. For example, an XPath 
expression 

//a[//6]/c 

produces the c-labeled children of a-labeled nodes having a 6-labeled descendant. Here // 
denotes descendant, / denotes child, and [ ] is a node test. The expression above looks for 
a-nodes (descendants of the root) in which the test [//h] is true (the existence of a node 
labeled h) and from there it proceeds to children of such nodes labeled c. While this is the 
syntax one typically finds in the literature on XPath, here we shall use a different syntax, 
highlighting connections with temporal logics. 

In this section we shall look for connections between XPath, FO on trees, and temporal 
logics, which are designed to talk about properties of paths. 

Logics introduced in the context of studying XPath, and more generally, navigational prop- 
erties of XML documents, can be roughly subdivided into two groups. Firstly, one may 
try to establish analogs of Kamp's theorem (stating that FO = LTL over strings) for trees. 
Secondly, one can try to extend Hafer-Thomas's theorem (the equivalence FO = CTL*) 
from binary to unranked trees. 

Both ways are possible, and in both cases we get FO completeness results, stating that 
some temporal logics have precisely the power of unary FO queries. 

4.1. XPath and temporal logics. We start with LTL-like logics. First, recall the syntax 
of LTL over alphabet S: 

:= a, a G S | V | -199 | X(/9 | X^^j | ipXiip' \ ipSip' . 

Formulae of LTL are interpreted over finite or infinite strings over S: a formula is evaluated 
in a position in a string. Given a string s = aofli . . ., we define the semantics as follows: 

• (s, i) \= a iff Oj = a; 

• {s,i) \= Xip ("next" (p) iff {s,i + 1) \= tp; 

• {s, i) \= X^(/9 iff (s, i — 1) \= ip; 

• (s, i) \= ip\]ip' {ip "until" ip') if there exists j >i such that (s, j) \= pJ and (s, k^\= ip 
for all i < A; < j; 

• the semantics of the dual Lp^ip {ip "since" ip') is that there exists j < i such that 
(s, j) \= ip' and (s, k) \= ip for all j < k < i. 

Note that it is possible to avoid X and X^ by defining a strict semantics for U and S, 
without requiring ip to be true in (s, i). 

We now consider a logic TL*^'^'' (tree temporal logic, cf. |Mar05l ISch92j ) defined as follows: 

ip,Lp' := a,a€S | ipM pJ \ -k/J | X^.^? | X~(/3 | ip\J^(p> \ (p'Si^pJ , 

where * is either 'ch' (child) or 'ns' (next sibling). We define the semantics with respect to 
a tree T and a node s in T: 

• (T, s) ^ a iff Ar(s) = a; 
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• (T, s) \= Xch'/' if {T, s ■ i) \= ip for some i; 

• (T, s) ^ if (T, s') ^ (/? for the node s' such that s' ^ch 

• (T, s) 1= ipUchf' if there is a node s' such that s s', (T, s') |= (/?', and for all 
s" 7^ s' satisfying s s" s' we have (T, s") ^ y?. 

The semantics of Sch is defined by reversing the order in the semantics of Uch, and the 
semantics of Xns,X~,Uns, and Sns is the same by replacing the child relation with the 
next sibling relation. 

rpj^tree ^^turally defines unary queries on trees, and it also defines Boolean queries: we say 
that r ^ 99 if (r,e) ^ 93. 

Theorem 4.1. (see IMarOSj l A unary or Boolean query over unranked trees is definable in 
FO iff it is definable in TL*^'=^ 

In CTL*-like logics, there are two kinds of formulae: those evaluated in nodes of trees, and 
those evaluated on paths in trees. This is similar to the situation with XPath, which has 
filter expressions evaluated on nodes, and location path expressions, which are evaluated 
on paths in XML trees. We shall now present two logics: CTL* with the past, in the spirit 
of jKP95j ■ and a CTL-like reformulation of XPath, as presented in jMarOBj . 

We start with XPath-inspired logics, and present them using a slight modification of the 
syntax that keeps all the main XPath constructions and yet makes the connection with 
temporal logics more visible. 

The language CXPath |Mar05j (Conditional XPath) is defined to have node formulae a and 
path formulae (3 given by: 

a, a' := a,aGS | -la | a\J a' \ E/3 

13,(5' := la \ step | (step/?a)+ | p/(3' | /? V /3' 

where step is one of the following: ^ch; "<di5 ^ns, or -<~g. Intuitively E/? states the existence 
of a path starting in a given node and satisfying /?, ?a tests if a is true in the initial node 
of a path, and / is the composition of paths. 

Formally, given a tree T, we evaluate each node formula in a node (that is, we define 
(T, s) 1= a), and each path formula in two nodes (that is, (T,s,s') \= (3). The semantics is 
then as follows (we omit the rules for Boolean connectives): 

• (T, s) ^ a iff Xt{s) = a; 

• (T, s) ^ E/3 iff there is s' such that (T, s, s') ^ /3; 

• (T, s, s') |=?a iff s = s' and (T, s) \= a; 

• (T, s,s') 1= step iff {s,s') G step; 

• (T, s, s') \= 13/(3' iff for some s" we have (T, s, s") [= (3 and (T, s", s') h 

• (T, s, s') \= (step/?a)+ if there exists a sequence of nodes s = sq, si, . . . , Sk = s', 
k > 0, such that each (sj, Sj+i) is in step, and (T, Sj+i) \= a for each i < k. 

The language Core_XPath IGK+051 IGKPDSj is obtained by only allowing step+ as opposed 
to (step/Ta)"*" in the definition of path formulae. Notice that since step+ = (step /? true)~^ , 
where true = VagE have Core_XPath C CXPath. 
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The earlier example of an XPath expression {//a\//b]/c) can be represented in this syntax 
by a node formula c A E(-<^ /la/ /lb) saying that a node is labeled c, and there is a 
path that starts by going to its parent, finding a there, and then going to a descendant of 
that a and finding a b. 

Core_XPath corresponds to XPath as defined by the W3C |CD99j . while CXPath represents 
an addition to XPath proposed by jMar05^. This addition is essentially the "until" operator 
of temporal logic: for example, to represent the strict version of until (that is, to say that 
in the next element of a path aUb holds), one could write ^ch V (^ch /'^cl)~^ / ^ch /?^- 

Node formulae of either CXPath or Core_XPath naturally define unary queries on trees. 
These can be characterized as follows. 

Theorem 4.2. a) (see |Mar05j ) The node formulae o/ CXPath have precisely the power of 
FO unary queries. 

b) (see |MdR04| ) The node formulae o/Core_XPath have precisely the power of unary FO^ 
queries (that is, FO with two variables) in the vocabulary ^ch, ^ch' ^ns, ^ns- 

Part a) of Theorem 14.21 can also be extended to formulae in two free variables, see |Mar05j . 

4.2. A CTL*-like logic. The logics CTL (computation tree logic) and CTL* are branching 
time temporal logics used in verification of reactive systems. They are normally defined 
without past connectives, but here we use the syntax close to that of [KP9 5 to make it 
possible to reason about the past. In these logics, one also has node (usually called state) 
formulae and path formulae, but path formulae are evaluated on paths, not on arbitrary 
pairs of nodes. 

We define CTLpg^^^ node formulae a, and child and sibling path formulae (3ch and P^s, as 
follows: 

a, a := a (a G S) | -la | a V a' | E/?ch I E/3ns 

/3ch,/5ch '■= « \ ^Pch I /3ch V /?ch I Xch/?ch | X^h-^ch | /?chUch/3ch I /^chSch/^ch 
Pnsi l^-as • — ^ I ~'/^ns | Pns V /S^g | X^s/^ns | -^ns/^ns | /^nsUns/J^g | /3nsSns/3ns 

Given a tree, a child-path TTch is a sequence of nodes on a path from the root to a leaf, and 
a sibling-path is a sequence vTns of nodes of the form s ■ 0, . . . , s ■ {n — 1) for a node s with 
n children. We define the semantics of node formulae with respect to a node in a tree, and 
of path formulae with respect to a path and a node on the path (i.e., we define the notion 
of (T, 7r*,s) 1= for * being 'ch' or 'ns'). 

• (T, s) \= if there exists a path vr* such that s € vr* and (T, vr*, s) \= 13^; 

• (T, TTch, s) \= Xch/? if (T, TTch, s') \= (3, where s' is the child of s on path vTch; 

• (T, TTch, s) \= X^/? if (T, TTch, s') \= j3 where s' is the parent of s on vTch; 

• (T, TTchis) 1= /3Uch/3' if for some s' ^ s such that s' £ TTch and s ~<*^ s', we have 
(T, TTch, s') ^ p', and for all s s" s', s" / s\ we have (T, vTch, s") ^ (3. 

The definitions for Sch and for sibling-paths are analogous. 
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The following can be seen as an analog of the equivalence FO = CTL* for finite binary 
trees |HT87j . While the proof the connection between ranked and unranked tree, the 
straightforward translation from the binary tree fails because paths over translations of 
unranked trees may change direction between child and sibling-paths arbitrarily many times. 

Theorem 4.3. (see |BL05]) A unary or Boolean query over unranked trees is definable in 
FO iff it is definable in CTL* . 



4.3. Extensions of FO and regular languages. Over strings, FO falls short of all reg- 
ular languages, as it defines precisely the star-free ones. However, using arbitrary regular 
expressions is often convenient in the context of navigating in XML documents. 

Given a class C of regular expressions, define FO(C)* as an extension of FO with the rules: 
(i) if e is a regular expression in C over FO(C)* formulae 'ip{u,v), then e^{x,y) is a formula, 
and (ii) if e is a regular in C over FO(C)* formulae ip{u), then e~^{x) is a formula. The 
semantics is the same as for the case of ETL. If formulae if) are restricted to be Boolean 
combinations of atomic formulae P^) £ 5], we obtain the logic FO(C). 

Let StarFree be the class of star-free expressions, and Reg the class of all regular expressions. 

Theorem 4.4. (see INSnOQ a) FO(StarFree) = FO(StarFree)* = FO. 

b) FO(Reg) C FO(Reg)* C MSO. 

For more on FO(Reg) and FO(Reg)* and their connections with fragments of MSO such as 
the path logic |Tho87j . see |BLMn6llN?^ . 

4.4. Conjunctive queries over unranked trees. Conjunctive queries are a very impor- 
tant class of database queries: they correspond to the 3, A-fragment of FO. These are the 
same queries that can be expressed by selection, projection, and join in relational algebra, 
and thus they form the core of database queries. Their complexity had been studied ex- 
tensively. In general, the complexity of evaluating a conjunctive query (p over a database 
T> is in NP, in terms of both the size of ip and the size of T). In fact, the problem is 
NP-hard, and there has been a large body of work on classifying tractable cases (see, e.g., 

iHLsniiinsHnii). 

In the case of unranked trees, conjunctive queries are formulae of the form 

Lp{x) = 3y Ri A . . . A Rk, 

where each Ri is either Pa{z) or z ~< z', where z, z' are variables among x,y, and -< is one of 
^ch; ^ch' ^ns5 or ^*g. We Write CQ(-<i, . . . , ~<m) to denote the class of conjunctive queries 
over unranked trees in which only unary predicates Pa and binary predicates among -<i can 
be used. 

If we restrict ourselves to classes of conjunctive queries that use at most two binary pred- 
icates, then there is a complete classification for the complexity of query evaluation on 
unranked trees. 
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Theorem 4.5. (see |GKSn4j ) The maximal tractable classes of queries CQ(^i, . . . , ^m); 
where all -<i's are among {^ch,-<ch' ^ns,^ns}^ "''^ CQ(^ch, ^ns, ^ns) '^"■^ CQ(^*j^); all 
others are NP-hard. 

In fact, |GKS04j provided a more general (but rather technical) criterion for checking when 
evaluation is in Ptime, and that condition can be used for other relations present in a 
query. 

Conjunctive queries can also be used to capture all FO over unranked tree, even if more 
than one free variable is used, assuming path formulae of CXPath can be used as atomic 
predicates. More precisely, every FO formula ip{x) over unranked trees is equivalent to a 
union of conjunctive queries whose atomic predicates are /?(x, x'), where /3 ranges over path 
formulae of CXPath |Marn5j . 

5. Unordered trees 

In unordered trees, nodes can still have arbitrarily many children, but the sibling ordering 
^ns is no longer available. That is, we view trees as structures 

r=(Z),<h,(^a)a6E), 

where D is a tree domain, -<*^ is the descendant relation, and P^'s define the labels on D. 
Logics considered for unordered unranked trees typically introduce some form of counting, 
see (Rni5linH^inH^II)LMn4llMml3llNm 

A simple explanation for this comes from a modified notion of unranked tree automata and 
query automata for unordered unranked trees. A counting nondeterministic unranked tree 
automaton is a tuple Ac = {Q, F, 6), where, as before, Q is a set of states, and F C Q is a 
set of final states. Let Vq be the set of variables {v^ \ q & Q,k > 0}. Then the transition 
function 5 maps each pair {q,a), for q € Q and a G S, into a Boolean function over Vq. 
A run of A on an unordered tree T with domain D is then a mapping p^^ : D ^ Q such 
that if PAci^) — 1 ^ node s labeled a, then the value of 6{q, a) is 1, where each variable 
Vq. is set to 1 if s has at least k children s' with p^S^') = ^-iid to otherwise. A run is 
accepting if p^d^) £ and the set of unordered trees accepted by Ac (that is, trees for 
which there is an accepting run) is denoted by Lu{Ac)- 

A counting query automaton QAc is defined as {Q, F, 6, S) where S" C Q x S; it selects nodes 
s in a run p where (pAci^)^ ^Tis)) € S. As before, it can be given both existential and 
universal semantics. 

The following appears not to have been stated explicitly, although it follows easily from 
results in |^ev991 ISSMOSj . 

Theorem 5.1. a) A set of unordered unranked trees is MS>0- definable iff it is of the form 
Lu{Ac) for a counting nondeterministic unranked tree automaton Ac- 

b) A unary query over unordered unranked trees is MSO-definable iff it is definable by a 
counting query automaton QAc under either existential or universal semantics. 
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5.1. MSO and FO over unordered trees. Now we look at several alternative characteri- 
zations of MSO and FO over unordered unranked trees that exploit the counting connection. 

Define the counting ^-calculus (cf. jJLOlj ^ as an extension of with formulae 0-^{E)ip. 
The semantics of (T, s) ^ 0^^{E) ip is as follows: there exist distinct elements si, . . . , 
such that (s,Si) € E and {T,Si) \= (p for every 1 < i < k. The next result follows from 
|Wal02j . as was noticed in |JL01j : 

Theorem 5.2. Over unordered unranked trees, MSO and C^[^ch] have precisely the same 
power with respect to Boolean queries. 

In fact, it is not hard to show that MSO can be translated into alternation-free C^, and 
thus evaluated with complexity 0(||T|| • \\<f\\), where if is an alternation-free formula. 

For first-order logic, counting extensions of both the temporal logic TL*''^'^ and CTL* give us 
analogs of Kamp's and Hafer-Thomas's theorems. We define TL^q^^^^ as a version of TL*'^'^'^ 
in which only modalities for the child relation are used, but in addition we have formulae 
X^j^99, with the semantics that (T, s) \= ^^h^ there are at least k children s' of s such 
that (T, s') ^ ip. 

We also extend CTL* with counting. In this counting extension CTL*^^^^, we have new 
state formulae EX^j^a, where a is a state formula, with the same semantics as above. 

Theorem 5.3. (see |MR03l rSch92j ) Over unordered unranked trees, the classes of Boolean 
queries expressed in FO, TL^J^^^^, and CTL^q^^^ over binary relation ^ch, the same. 

For unary queries, the equivalence FO = TVJ^^^^ still holds |Sch92j . and FO can be shown 
to be equivalent to an extension of CTL* with both counting and the past ^BL05 , Rab02j . 

Adding counting does not increase the complexity of model-checking in temporal logics, 
which is 2<^(ll^ll) • ||r||, cf. ;"C€P99 . 

Unordered fragments of XPath have also been looked at in the literature. For example, 
|BFK03j showed that the restriction of positive (no negation) Core_XPath that only uses 
^ch ^-nd is equivalent to existential positive FO formulae over the vocabulary that 
includes both and ^*^. 

5.2. Extensions and more powerful counting. Consider now a scenario in which we 
deal with unordered trees, but in our formulae we can refer to some arbitrary ordering on 
siblings: after all, in any encoding of a tree, siblings will come in some order. Of course we 
do not want any particular order to affect the truth value, so we want our formulae, even if 
they use an ordering, to be independent of a particular ordering that was used. 

This is the standard setting of order-invariance, a very important concept in finite model 
theory, cf. jLibO^. We say that an MSO sentence ip over vocabulary including ~<*^ and 
is -^ns-ii^variant if for any unordered tree T and any two expansions T"^ns and T"^ns with 
sibling-orderings -<\^ and -<\^ we have T^n^ \= ip r^ns ^ Any ^ns-invariant sentence 
defines a Boolean query on unordered trees. 
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We now define MSOmod [ConQDj as an extension of MSO with modulo quantifiers: for each 
set variable X, and k > 1, we have set new formulae Qk{X) which are true iff the cardinahty 
of X is congruent to modulo k. 

Theorem 5.4. (see |(yOu91j ) Over unordered unranked trees, -K^s-invariant Boolean queries 
are precisely the Boolean queries definable in MSOmod- 

Further extensions in terms of arithmetic power have been considered in 'SSM03' ISS+041 . 
Recall that Presburger arithmetic refers to the FO theory of the structure (N, +), and it is 
known that this structure admits quantifier elimination in the vocabulary (+,<,0, l,(~fc 
)k&i) where n ~fc m iS n — m = O(mod k). We next define Presburger MSO, called PMSO, 
as an extension of MSO over unordered trees with the following rule: if (p{x, y, X) is a PMSO 
formula and a{v) a Presburger arithmetic formula with \X\ = \v\ = n, then [(p/a]{x,y,X) 
is a PMSO formula. Given valuation s, sq, S for free variables, with S = {Si, . . . , Sn), let 
rui be the number of children of sq that belong to Si, that is, the cardinality of the set 
{s' I So ^ch s' and s' € Si}. Then [ip/a]{s, sq, S) is true iff a{mi, . . . , rUn) is true. 

It is easy to see that MSO C MSOmod £ PMSO over unordered trees. Still, PMSO is 
captured by a decidable automaton model. 

Define Presburger unordered tree automata just as counting automata except that 6 maps 
pairs from QxT, into Presburger formulae over Vq, for q €z Q. We interpret Vq as the number 
of children in state q, and a transition is enabled if the corresponding Presburger formula is 
true in this interpretation. That is, in a run p of such an automaton, if p{s) = q, the label 
of s is a and 5{q, a) = i • • • > Vg^), then xC^-i) • • • > ^m) is true, where rij is the number 
of children s' of s such that p{s') = qi. 

Theorem 5.5. (see [SSMDSj ) Presburger unordered tree automata and PMSO are equiva- 
lent. Furthermore, both emptiness and universality are decidable for Presburger unordered 
tree automata. 

Further extensions with counting have been considered for fixed-point logics ISS+041 and 
the ^-calculus with modulo-quantifiers [BLDSj . 

5.3. Edge-labeled unordered trees. While in the early days of tree-based data models 
there was some debate as to whether labels should be on edges or nodes, the arrival of XML 
seems to have settled that dispute. Nonetheless, there are several areas where edge-labeled 
trees play a prominent and role, and traditionally logical formalisms have been designed 
for such data. First, there are logics for feature trees, which are a special case of feature 
structures used extensively in computational linguistics |Car92j . Second, in recent work on 
spatial logics, used for describing networks and mobile agents |CG00j . one looks at modal 
logics over unordered edge-labeled trees. 

In the setting of feature trees, one has an infinite set of features J^, and in an unordered 
unranked tree every edge is labeled by an element f € J-' such that each node s has at 
most one outgoing edge labeled / for each f £ J^. Furthermore, nodes may be labeled 
by elements of some alphabet S, as before. It is thus natural to model feature trees as 
structures {D, {Ef)f^jr, {Pa)aeT.) such that the union of all -E/'s forms the child relation of 
a tree, and no node has two outgoing Ef-edges. 
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In the context of computational linguistics, one commonly used logic for feature trees |Bla94j 
is the propositional modal logic that, in the context of feature structures (not necessarily 
trees), is also often supplemented with path-equivalence, stating that from a certain node, 
one can reach another node following two different paths. This is the setting of the Kasper- 
Rounds logic RK86 . Over trees, however, path-equivalence is the same as equality of 
paths. A more powerful logic proposed in |Kel93j combined the Kasper-Rounds logic with 
the propositional dynamic logic. Its formulae are defined by 

:= a, a G S | (/? V 93' | -193 | ^{e)ip \ e e' , 

where e, e' are regular expressions over J^. Formulae are evaluated in nodes of a feature tree 
T. We have (T, sq) \= C>(e)(^ if there is a path (sq, si) E Ef^, (si, S2) G Ef^ , ■ ■ ■ , (sn-i, Sn) G 
^U-i such that (T, s„) \= ip and fofi ■ ■ ■ fn~i is a word in the language denoted by e. 
Furthermore, (T, s) ^ e ~ e' if there is a node s' that can be reached from s by a word in 
e as well as a word in e'. This semantics is normally considered over graphs, but over trees 
this is equivalent to saying that there is a node reachable by an expression in the language 
denoted by eCie'. That is, e ^ e' is equivalent to 0{ene')true, and thus the Kasper-Rounds 
logic is effectively a reachability logic over trees. 

The reader is referred to |Kel93j for computational linguistics applications of this logic. 
In terms of expressiveness it is clearly contained in MSO, and if all expressions e, e' are 
star-free, then in FO as well, as long as we have the descendant relation. 

Automata for feature trees, based on the algebraic approach to recognizability |Cou90j . 
were considered in |NP93j (which also showed that over flat feature trees the automaton 
model coincides with a simple counting logic). 

5.4. An ambient logic for trees. Ambient logics are modal logics for trees that have 
been proposed in the context of mobile computation ICGODj and later adapted for tree- 
represented data ICarOllinncn] . O ne views trees as edge-labeled and defines them by the 
grammar 

T,T' := A I T\T' \ a[T], a E S, 

with the equivalences that | is commutative and associative, and that T\A = T. Here A 
is the empty tree, | is the parallel composition, and a[T] adds an a-labeled edge on top of 
T. If we extend = to a congruence in the natural way, then every tree is equivalent to one 
of the form ai[ri]| . . . |am[7»rt], which is viewed as a tree whose root has m outgoing edges 
labeled oi, . . . , Cm, with subtrees rooted at its children being Ti, . . . , Tm- 

There were several similar logics proposed in |(XX;n3l ICarfTTl KX^OI I KX^nOl IDT.Mn4j . Here 
we consider the logic from CCGOS^ whose formulae are given by 

ip,ip' := -L I A I Lp A ip' I -193 I (p\(p' I ip>ip' I a[ip] \ ip@a, a € S. 

The semantics is as follows: 

• _L is false; 

• A is only true in a tree equivalent to A; 

• T \= ipi\ip2 iS T = Ti\T2 with Tj |= c^j, i = 1,2; 

• T \= ip> ip' if for every T' with T' \= ip we have T\T' \= ip'; 

• r ^ a[ip] iST = a[T'] with T' h 
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• T\= ip@a iff a[T] \= ip. 
Variations appear in the literature, e.g. with the Kleene star in |DLMfl4j and recursion in 

mm- 

The study of ambient logics for trees took a very different path compared to other logics seen 
in this survey; in particular, the focus was on type systems for tree languages and thus on 
proof systems for logics, rather than model-checking, its complexity, automata models, and 
comparison with other logics. Several lines of work closely resemble those for node-labeled 
trees: e.g., |DLM04j introduced Presburger conditions on children, defined an automaton 
model, and proved decidability, similarly to |SSM03l |SS+04| . 

However, the ambient logic does not take us outside of the MSO expressiveness: this can be 
seen by going from edge-labeled trees to node-labeled ones. The translation is simple: the 
label of each edge {x, y) becomes the label of y. The root will have a special label Root that 
cannot occur as a label of any other node. The only modification in the logic is that now we 
have formulae Aq for a G S, which are true in a singleton-tree labeled a. The resulting logic 
is easily translated into MSO. For example, (p\(p' states that the children of the root can be 
partitioned into two sets, X and X', such that the subtree that contains all the X-children 
satisfies ip and the subtree that contains all the X'-children satisfies ip' . For (p>ip' , one can 
consider -1(99 1> 99') saying that there exists a tree T' such that T' \= ip and T\T' \= -199', and 
use nondeterministic counting automata to guess this tree T'. 

Since moving labels from edges to nodes and back can be defined in MSO, we see that 
the ambient logic is embedded into MSO. However, to the best of the author's knowledge, 
this direction has never been seriously pursued, and the exact relationship between ambient 
logics and other logics described in this survey is still not well understood. 

6. Automatic structures 

In this section we look at a different kind of logics for unranked trees, using the standard 
approach of model theory. So far we represented each tree as a structure and looked at 
definability over that structure. Now we want to consider structures whose universe is the 
set of all trees. Definability over such structures allows us to describe sets of trees and, 
more generally, relations over trees. Choosing the right operations on trees, we shall find 
structures where definable sets are precisely the regular languages. Such structures are very 
convenient for proving that certain properties of trees are regular, as it is sometimes easier 
to define properties logically than to construct automata for them. 

Let Tree(S) be the set of all S-labeled unranked trees. We consider structures of the form 
9Jl = (Tree(S), Q) where $7 is a set of relation, constant, and function symbols, interpreted 
over Tree(S). 

Let Def„(9Jl) be the family of n- dimensional definable sets over OJt: that is, sets of the form 

{f G Tree(S)" I m h ^{f )], 

where ip{xi, . . . first-order formula in the vocabulary $7. We shall be looking at 

structures OJt so that definable sets would be relations definable in MSO or other logics. In 



22 



L. LIBKIN 



particular, such relations will be given by automata, and thus structures DJl of this kind are 
called automatic structures. 

6.1. Automatic structures on strings. Before we move to trees, we first survey auto- 
matic structures over strings, cf. [BGOOl |BL+03| . In this case we consider structures of the 
form (S*,r2). Our first example has the following relations in fi: 

• -< is a binary relation; s ^ s' is true iff s is a prefix of s'; 

• La, a G S, is a unary relation; La{s) is true iff the last symbol of s is a; 

• el is a binary relation; el(s,s') is true iff |s| = 

Let Suniv be the structure (S*, ^, (La)aGS, el). Then Suniv is the universal automatic 
structure: that is, relations Defn (Suniv) are precisely the regular relations. Following a 
standard definition - see, e.g., - we say that a relation S C (S*)" is regular iff 

there is an automaton A over alphabet (S U {#})" that accepts precisely the strings [s], 
for s = (si, . . . , Sn) € S. The length of [s] is maxj |sj|, and the jth symbol of [s] is a tuple 
((Ti, . . . , (Tn), where dj is the jth symbol of Si if \si\ < j , and # otherwise. 

Thus, DGfi (Suniv) contains exactly the regular languages over S. Furthermore, the con- 
version of formulae over Suniv fo automata is effective jBOOOj and the theory of Suniv 
is decidable. In fact the theory of every structure that is interpretable in Suniv is thus 
decidable. 

As an example, consider the structure (Q, <). Since it is isomorphic to ({0, <iex), 
where <iex is the lexicographic ordering (which is easily definable in Suniv); we obtain the 
well-known decidability of (Q, <). 

A restriction of Suniv that does not have the equal length predicate, that is, S = (S*,^ 
, (La)ags) is known to be strictly weaker that Suniv in every dimension: in particular, el is 
not in Def2(S), and Defi(S) is precisely the class of star-free languages |BL+03| . 

Notice that both the empty string e and functions ga{s) = s-a are definable in S, and hence 
another well-known theory interpretable in S and Suniv is that of unary term algebras. 
However, it is known that for binary term algebras, adding relations like -< results in 
undecidable theories 'MNT981 IVen87j . In particular, if we want to keep an analog of the 
^-relation (which is MSO-definable), we cannot introduce an operation like the | operation 
in the ambient logic, and still have a decidable theory. 

6.2. Automatic structures on trees. To get structures over Tree(S) that define regular 
languages and relations^, we find natural analogs of ^, La, and el for trees. For two trees 
Ti and T2 with domains Di and D2 , we say that T2 is an extension of Ti , written Ti :^T2, 
if Di C D2, and the labeling function of T2 agrees with the labeling function of Ti on Di. 
It will actually be more convenient to work with two extension relations: 

Extension on the right : For Ti T2, we require that every s S D2 — Di 

be of the form s' ■ i when s' ■ j ^ Di for some j < i. 



'The notion of regular relations for trees is obtained in the same way as for strings 
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Extension down : For Ti we require that every s € D2 — Di have a prefix 

s' which is a leaf of Ti . 

Clearly Ti ^ T2 iff there is T' such that Ti T' and T' ^ j T2, so in terms of definability 
we do not lose anything by using and instead of ^. 

We define La to be true in a tree T if the rightmost node is labeled a. That is, the node 
s (z D which is the largest with respect to <iex is labeled a. For the analog of el, recall 
that in the standard representation of strings as first-order structures, the domain is an 
initial segment of N, corresponding to the length of the string. Hence, el(si, S2) means that 
if strings are represented as structures, their domains are the same. We thus introduce a 
predicate ~dom such that Ti ~dom ?2 ifi^ Di = D2 (there Di is the domain of Tj). 

Now we define analogs of ©univ and ©: 

Tuniv = (TREE(S), ^i, {La)aeT., ~dom) 

1 = (TREE(S), {La)ae^) 

Theorem 6.1. (see jBLN06]) a) For 6V6Ty 77- ^ 1, D0f^(*Xuniv) '^^ pTcciscly thc class of 
regular n-ary relations over Tree(S). 

h) Defi(T) = Defi(Tuniv) is the class of regular unranked tree languages, but for every n > 1, 
DeUi'^) £ Def„(Tu„iv). 

Notice the difference with the string case, where removing el (domain equality) resulting in 
a smaller class of one-dimensional definable sets: star-free languages. On the other hand, 
even over binary trees, the notions of star- free and regular coincide |PT93j . 

Working with luniv makes it easy to write rather complicated properties of tree languages, 
and then Theorem 16.11 implies that those languages are regular. For example, if X C 
Tree(S) is regular, then the set of trees T such that all their extensions can be extended 
on the right to a tree in X is regular. Indeed, this is defined as if{T) = \/T'{T ^ T' ^ 
3T"{T' T" A axiT"))), where ax defines X (by Theorem 16.11 we know such ax 
exists). Then Theorem 16.11 again tells us that ip defines a regular language. Furthermore, 
the conversions from formulae to automata are effective for both T and Tunivj which implies 
decidability of their theories. 

Other logics over unranked trees can be naturally represented over these structures. Con- 
sider, for example, a restriction of first-order logic over 1. or Tuniv in which all quantification 
is over branches. A branch is a tree T such that the set {T' \ T' ^ T} is linearly ordered by 
:<. Let Def^ be the class of sets of trees (equivalently, Boolean queries over trees) definable 
in this restriction. 

Proposition 6.2. (see |BLN06j ) Def^(T) is precisely the class of FO-definable Boolean 
queries over unranked trees, and Def^(Tuniv) is the class of Boolean queries definable in a 
restriction of MSO in which quantification is allowed only over sets linearly ordered by ~<*^ 
or by ^*,. 

For more results of this type, see jBLNfld] . 
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6.3. A different view of unranked trees. We conclude by presenting a different view 
of unranked trees and a different structure for them that makes it easy to talk about about 
their extensions in which new children may be inserted between existing ones. For example, 
if we have a tree T with domain D = {e, 0, 1}, and we want to add more children of the 
root, they would have to be added on the right, e.g, we may have an extension with domain 
{e, 0, 1, 2, 3}. But what if we want to add a child on the left of 0, and two children between 
1 and 2? Intuitively, we need a new tree domain {e, — 1, 0, ^, |, 1} then. We now capture 
this situation and present a different automatic structure that makes it easy to derive that 
certain relations on trees are regular. 

A rational unranked tree domain is a finite prefix-closed subset of Q*. Relation -<*-^ is 
defined for rational domains just as before, and relation -<*g is now given by s ■ r s ■ r' 
iff r < r' . Then an unranked tree T over a rational unranked tree domain is, as before, a 
structure T = {D, ~<*^, (Pa)aGs)- 

Let Treeq(S) be the set of all unranked trees with rational unranked tree domains. Note 
that different trees in Treeq(I;) may be isomorphic; we denote this isomorphism relation 
by =. There is a natural one-to-one correspondence between TreEq(S)/ = and Tree(S). 

We define the extension relation ^ over trees in TreEq(5]) as before. A branch, again, is 
a tree T G TreEq)(E) such that the set {T' \ T' :< T} is linearly ordered by ^. It follows 
from the definition of rational unranked tree domains that the domain of a branch consists 
of all the prefixes of some string s G Q*; i.e., it is completely determined by s, which is its 
unique leaf. Let La{T) be true iff T is a branch whose leaf is labeled a, and let Ti <iex ^2 
be true iff Ti and T2 are branches with leaves si and 52, and si <iex S2- We then define the 
structure 

'l!^^^ = (TREEq(S), ^, <iex, ^dom, (^a)a6s)- 

In this structure it is much easier to reason about tree extensions that allow one to in- 
sert nodes between existing ones, and not only on the right or under the leaves. But 
what about definable sets and relations over Ty^jy? It turns out that they are all regu- 
lar. More precisely, we can interpret 1^^^^ in Tuniv: that is, find a set X G Defi(luniv), 
binary relations Ri,R2,R3 G Def2 (Tuniv) and sets Ya G Defi (Tuniv)) a G S, such that 
{X, Ri, i?2, -R3, (^)aes) is isomorphic to T^^;^. That is, we have: 

Proposition 6.3. The structure T^^j^ is interpretable in Tuniv Furthermore, there is a 
definable subset of the image of TreEq(S) that contains exactly one representative of each 
=-equi valence class. 

That is, under the mapping l : TreEq(S)/ =^ Tree(S), definable sets and relations 
over T^j^-y become precisely the regular tree languages (and relations). Hence, expressing 
properties of unranked trees in first-order logic over T^j^j^ allows us to conclude easily that 
certain tree languages are regular, and thus MSO-definable. 
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<a> 

<b> 

<a></a> 
<c></c> 
<b></b> 
</b> 
<a> 

<b></b> 
<c></c> 
</a> 
</a> 

Figure 6: An XML document as a tree and as a sequence of tags 

7. Other directions and conclusions 

We present here a somewhat random sample of other directions that work on logics for un- 
ranked trees has taken or may take in the future. We concentrate on streaming applications, 
and then briefly describe other directions. 

Streaming XML documents. A typical XML document is a sequence of matching opening 
and closing tags, with some data between then. For example, the sequence of opening and 
closing tags corresponding to a tree is shown in Figure Thus, an XML tree naturally has 
a string representation. For example, for the tree in Figure IHl such a representation is 

abaaccbbbabbccaa, 

where we use a label, say a, for the opening tag <a>, and a for the closing tag </a>. More 
generally, for an ordered unranked tree T we define inductively its string representation 
str(r): 

• if T is a single node labeled a, then str(T) = aa 

• if r has a root labeled a, with n children sq ^ns • • • ^ns Sn-i, such that Tj is the 
subtree rooted at Sj, i < n, then str(T) = a str(To) . . . str(T„_i) a. 

If an XML document T is transmitted as a stream, then the object we work with is precisely 
str(r). Furthermore, we may not have the whole string str(T) available, or may need to 
compute some of its properties without looking at the whole string (for instance, a device 
receiving the stream may have memory limitations and cannot store the entire stream). 
One possible model for this scenario was proposed in |S V02j : in this model, one processes 
the stream str(T) by using a finite string automaton. It is natural to ask then what kinds 
of properties of trees can be recognized by finite automata that run on their streamed 
representations. More precisely, one is interested in tree languages of the form 

L^r = {T I str(r) is accepted by A}, 

where ^ is a string automaton. 
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This question has been primarily addressed in the context of DTD vaUdation. Namely, 
given a DTD d, is it possible to find an automaton Ad such that 

L% = SAT(d)? 

In general, the answer is negative, as was shown in |SVn2j . We now sketch a very simple 
proof of this. Consider the following DTD di: 

a ^ ab \ ca \ e, 6 — >• e, c ^ e. 

Suppose SAT((ii) = for some A. The regular language given by A is definable in MSO, 
say by a sentence of quantifier rank r. Choose numbers n and k so that a" and a""'"'' cannot 
be distinguished by MSO sentences of quantifier rank r, and consider two strings: 

si = a" (acc)" aa a" {bba)"- 
S2 = a"+^ (accf aa 0"+^= {bba)'' 

which in turn (by a standard composition argument, see, e.g., |Lib04l rrho97j l cannot be 
distinguished by A. One clearly has si = str(Ti) for some Ti G SAT(di), and S2 = str(T2) 
for a tree T2 G SAT(d2) — SAT((ii), where ^2 is 

a —f a \ ab \ ca \ e, 6 — > e, c — > e, 

which contradicts the assumption SAT((ii) = Lj^. 

While |SVn2j provides many results on streamed validation of DTDs, the problem of char- 
acterizing DTDs that can be checked by finite automata over streamed representations 
remains open. Such a characterization can be found for MSO-definable properties as fol- 
lows. Given an MSO sentence (p over ordered unranked trees, we say that ip is streamable 
if {T I T 1= 99} is of the form L^'' for some finite string automaton A. 

Let s be a node in a tree T; define rma(s) (the right-most ancestor) to be the smallest prefix 
of s such that each node s' with rma(s) ^ s' ^ s is the largest in the ordering. This 
naturally defines a string of labels, by collecting all labels of nodes between rma(s) and s. 
We denote this string by rms(s). For example, if s is the rightmost node in the tree shown 
in Fig. El then rms(s) = aac. Finally, for each regular language L over strings, we write 
f/rms^g) iff rms(s) G L. 

The following is due to Segoufin and the author. 

Proposition 7.1. An MSO sentence 99 over ordered unranked trees is streamable iff it is 
expressible in MSO over the vocabulary that includes ^fc, {Pa)aGT:, and f/™^, where L 
ranges over regular languages. 

However, the decidability of checking whether an MSO sentence belongs to the fragment of 
Proposition 17.11 remains open. 

Some recent results on processing queries over streaming data (especially XPath queries) 
can be found in IBY+051 IHK?^ . 
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Future directions and open questions. 

1) This survey has concentrated primarily on Boolean and unary queries. While these 
are sufficient in many applications, there are formalisms that require more general n- 
ary queries. For example, the core expressions of XQuery can be seen as rearranging 
arbitrary n-tuples of nodes selected from a tree as another tree. The logical study 
of XQuery is just beginning I KocOSj . and there are several papers that show how to 
extend results from logics that define Boolean and unary queries to arbitrary n-ary 
queries. For example, |Schnflj does it for queries definable in FO(Reg)-like logics. 
Using a similar approach, |ABLfl6j shows how to combine temporal logics over trees 
to define n-ary queries. An extension of unranked tree automata to n-ary queries is 
presented in |NP+05| . 

2) While we have a number of logics that provide a declarative approach to expressing 
properties of trees and yet match (or are close to) the complexity of the procedural 
automata formalism, it is not really understood what causes certain logics to have 
such a nice behavior. There must be some intrinsic properties of logics that lead 
to good model-checking algorithms (in a way similar to, say, finite- or tree-model 
properties being an explanation for decidability). 

3) Closely related to the first item is the issue of succinctness of logics, measured as 
the size of formulae needed to express certain properties. Initial investigation on the 
issue of succinctness for logics on ranked trees was done in |GS03j and some logics 
have been shown to be much more succinct than others, but more needs to be done. 
In view of the standard translation between ranked and unranked trees, it is likely 
that results for binary trees will be sufficient. 

4) The connection between FO, MSO, temporal logics and logics used in the program- 
ming languages and computational linguistics communities must be understood. 
The focus was quite different, as we mentioned earlier: for example, many questions 
about the complexity and expressiveness of ambient logics are unresolved. Some 
very recent results in this direction are reported in BTT05 . 

5) XML trees in addition to labels have data values associated with some nodes (typ- 
ically attribute values or PCDATA values). Adding values from a potentially infinite 
set and just equality over them immediately leads to undecidable formalisms. This 
is observed, in particular, in the study of XML constraints. Some typically consid- 
ered constraints include keys and foreign keys, that arise naturally when relational 
data is converted into XML. Keys say that a certain sequence of attributes iden- 
tifies a node uniquely. A key is unary if it consists of one attribute (for example, 
a unique id would be a unary key, while a pair (firstname,lastname) can be a key 
consisting of two attributes). A foreign key states that a sequence of attributes of 
each node labeled by ai should also occur as a sequence of attributes of some other 
node labeled 02. 

XML specifications may consist of DTDs together with constraints. However, 
their interaction could be quite complicated. In fact, I FL02j showed that it is unde- 
cidable whether a specification that consists of a DTD and a set of keys and foreign 
keys is consistent. However, if all keys and foreign keys are unary, then consistency 
checking is NP-complete. 

It would be nice to find a purely logical explanation for this type of results. 
Decidability restrictions studied in (NSVOL BPTOl] are very weak for this purpose. 
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However, a recent line of results shows much more promise. Consider trees that can 
carry data values, and assume that we can test them for equality, that is, we have 
a binary relation ~ that is true if two nodes in a tree have the same data values. 
Then FO^ over such trees with the ~ relation and the successor relation is decidable 
|B+06a| . Here FO^ refers to FO with two variables. Notice that for expressing unary 
constraints two variables suffice. It is open whether the descendant can be added 
while preserving decidability; the only resolved case is that of strings, where indeed 
FO^ over the successor relation, the linear ordering, and the ~ relation is decidable 
|B+06b| . 
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